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ABSTRACT (CONTINUED) 

Three ERTS-1 images from the Phoenix, Arizona area were processed, and 
recognition rates between 85% and 100% were obtained for the terrain 
classes of desert, farms, mountains and urban areas. 

To eliminate the need for training data, a new clustering algorithm 
has also been developed. Seven ERTS-1 images from four test sites 
have been processed through the clustering algorithm, and high recog- 
nition rates have been achieved for all terrain classes. 
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PREFACE 


A. OBJECTIVE 

The main objective of the investigation is to develop digital in- 
terpretation techniques such that earth resources can be recognized 
automatically from ERTS-1 images. Emphasis has been placed on three 
characteristics of the interpretation techniques: accuracy of recog- 

nition, efficiency and practicality of generating thematic maps and 
automation of the interpretation functions. 

B. SCOPE OF WORK 

Initial analysis of ERTS-1 computer compatible tapes showed that 
the data could not be used in its original form, and a major portion 
of the investigation's effort had to be directed into reformatting, re- 
sampling and filtering of the data. These preprocessing operations 
were time consuming. 

A photointerpretation task over one test site was completed and 
photographic methods for exploiting the ERTS imagery and correlating 
it to aerial photography and ground truth maps were developed. 

A diffraction pattern analysis of ERTS imagery disclosed the 
existence of spatial signatures for cultivated land, urban areas and 
mountains. The spatial signatures were then exploited in the develop- 
ment of digital spatial features that carry the necessary spatial 
information to identify terrain types accurately. This was demonstrated 
by developing an heuristic algorithm which operated on spatial feature 
vectors from ERTS Image 1049-17324-5. 

Later, the spatial features were combined with spectral features 
and the maximum likelihood criterion was employed in the terrain recog- 
nition algorithm. ERTS Image 1049-17324 was processed using this 
algorithm and the results provide useful comparisons between the spatial 
and spectral features and between the maximum likelihood criterion and 
the heuristic algorithm for which the decision boundaries are hyperplanes. 

An analysis of terrain class statistics showed that the statistics 
were not Gaussian and were reducing substantially the performance of the 
maximum likelihood criterion. Then, non-linear transformations of the 
feature vectors were devised which increase the accuracy of recognition 
results. 
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Three ERTS images from the Phoenix, Arizona test site were processed 
through the recognition algorithm and the results showed the seasonal 
variations in terrain class statistics as well as the physical seasonal 
changes in the test site. 

A new clustering algorithm was developed so that training data will 
not be required to determine the terrain class statistics. The clustering 
algorithm was applied on seven ERTS images from four test sites and the 
results were excellent indicating that the algorithm will operate satis- 
factorily on various ERTS data without any a priori information. One of 
the ERTS images processed through the clustering algorithm provides a 
comparison between the clustering algorithm and the maximum likelihood 
criterion with training data. 

The clustering algorithm utilizes the divergence criterion to deter- 
mine the statistical separability of potential clusters. This criterion 
was analyzed and found to be an accurate criterion of separability between 
terrain classes provided the class statistics were approximately Gaussian. 

Six of the ERTS images processed through the clustering algorithm were 
from three test sites, and the results showed the seasonal changes in the 
terrain classes. 

C. CONCLUSIONS 

The major accomplishment of this investigation is the development of a 
system of algorithms which can be applied on ERTS data for the automatic 
recognition of earth resources. 

The experimental results show that the major terrain types such as 
desert, mountains, urban areas, farmland etc. can be recognized with an 
average accuracy of 90%. 

The use of the maximum likelihood criterion in the clustering algorithm 
that was developed has eliminated the need for training data and nearly 
complete automation has been achieved since a human operator is only required 
to oversee the recognition process. 

The clustering algorithm has been applied to digital data from seven 
ERTS images from the States of Arizona, California, Washington and Utah. The 
geographical areas represented in these images have different terrain 
characteristics; yet the major terrain types were recognized with accuracies 
varying between 70 and 99 percent. This is an indication that the clustering 
algorithm can identify major terrain types accurately regardless of the 
geographical area represented by the data. In addition, the clustering 
algorithm will produce sub-classes of the major terrain types in certain 
geographical regions whenever the sub-classes can be statistically separated 
in the feature vector space. For example, the clustering algorithm identified 
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two classes of water in both the Salton Sea and the Great Salt Lake, three 
types of desert in the Imperial Valley, two types of farms in the States of 
Utah, Washington and California, etc. Most sub-classes were recognized 
with accuracies varying between 77 and 100 percent. 

Another important accomplishment of this investigation was the demonstra- 
tion that spatial features obtained by selective sampling of the Fourier 
transforms of small areas of ERTS images contain sufficient information to 
recognize major terrain types. Recognition accuracies varying between 74 
and 97 percent were obtained in the Phoenix, Arizona area using only spatial 
features from the Fourier transforms of cells with 32 x 32 picture elements 
from the MSS 5 spectral band. When the spatial features were combined 
with spectral features, the recognition accuracy of the urban areas increased 
from 74 to 95 percent. 

D. RECOMMENDATIONS 

It is recommended that the software system and the pattern recognition 
techniques developed under this investigation be applied on many problems 
of national importance for which remote sensing of the earth by the ERTS-1 
satellite can provide very valuable information. Two such problems have been 
identified for the proposed ERTS-B investigation (see Abstract in Section 8). 
In addition, the following applications are suggested as being potentially 
of high economic value: 

a. Monitoring of agricultural crops and forest resources to detect 
crop stresses and estimate crop yield . 

b. Uniform land use mapping for the entire country. 

c. Monitoring of snow fields to predict flooding and estimate 
water resources. 

d. Detection of water pollution. 
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1. INTRODUCTION 


This is the final report for this investigation and covers the technical 
work done and the results achieved under contract NAS5-21766. The report is 
intended to provide a concise summary of the accomplishments of this investi- 
gation and contains material taken from many other reports and papers pub- 
lished during the course of this investigation. Most of the material 
originated from the following reports and papers: 

1. Type II report of December 1972 {Ref. 11) . 

2. Type II report of August 1973 (Ref. 12). 

3. Technical report of December 1972 (Ref. 1). 

4. Technical report of March 1973 (Ref. 3) . 

5. Paper given to ERTS Symposium in March 1973 (Ref. 4). 

6. Paper given to ASP. Fall Conference in Buena Vista , Florida 
in October 1973 (Ref. 13) . 

7. Paper given to ASP Remote Sensing Symposium in Sioux Falls, 

South Dakota in October 1973 (Ref. 14). 

8. Paper submitted to ARETS Conference Proceedings in Tuscon, 

Arizona in November 1973 (Ref. 15) . 

9. Paper given to ERTS Symposium in December 1973 (Ref. 16). 

The main body of the report is divided into the following sections: 

A. Input Data Operations, which describes the data handling opera- 
tions necessary for the utilization of the data received from NASA. 

Particular attention is given to the ERTS Digital Tapes (Bulk CCT's) for 
which the digital preprocessing operations constitute a major part of the 
work done under this investigation. 

B. Photointerpretation Tasks, which are necessary for the location 
of ground truth, familiarization with the test sites and the ERTS images 
and for evaluating the digital terrain type recognition operations. 

C. Terrain Spatial Signatures. This section describes the work 
done to isolate signatures for the various types of terrain. The signa- 
tures that have been developed and means for efficiently measuring them 
are also described. 
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D. Terrain Type Recognition. This is an extensive section des- 
cribing algorithms for recognizing terrain and the results obtained when 
these algorithms were employed. The algorithms utilized both multispectral 
and spatial data. The clustering algorithm is included in this section. 

E. New Technology Report. 

F . Cone lus ions. 

G. Potential Applications and Significant Results. 

H. Recommendations. 
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2 . 


INPUT DATA OPERATIONS 


2.1 ERTS Images 

Since February 1, 1973, a substantial number of 9-1/2 inch positive 
and color transparencies has been delivered to the project by NDPF. A 
system of checking and cataloging these images has been developed so that 
they can be utilized by the project efficiently. 

Through the standing orders, the project receives black and white 
positive transparencies on 9-1/2 inch film. The images may contain a 
portion or an entire test site and are the first indication that a test 
site has been acquired on a specific date. Upon receipt of the trans- 
parencies, an overlay is produced photographically. The overlay contains 
all the annotations and coordinate marks of the MSS 5 (Red Band) image. 
Georgraphic coordinates and the test site coordinates are plotted on the 
overlay. (See Figure 2-1.) Using the overlay, it is possible to deter- 
mine the percentage coverage of the test site. 

At this point, a decision is made whether to submit a request for 
digital tapes. The decision is based on cloud cover, haze, percentage 
of test site photographed, as well as on previous acquisitions of the 
test site. 

The ERTS images are also being utilized for the photo interpretation 
tasks (section 3) and for the development of terrain spatial signatures 
(section 4). Both of these operations require changes in scale which 
are accomplished very efficiently photographically. The photo interpre- 
tation tasks require enlargements while the spatial signatures require, 
high resolution reductions in scale in order to obtain large diffrac- 
tion patterns. 

t 

2.2 ERTS Digital Tapes 

A number of digital operations are required to utilize the data 
available in the bulk Computer Compatible Tapes (CCT's). Each computer 
compatible tape delivered by NDPF contains one-quarter of an ERTS image. 
The four spectral bands (for an MSS image) are interleaved. In addition, 
the sampling interval (in meters) along a scan line is smaller than the 
sampling interval between scan lines. In other words, there are more 
picture elements along a kilometer on the surface of the earth parallel 
to a scan line (approximately east-west direction) than a kilometer nor- 
mal to a scan line (approximately north -south direction) . Furthermore, 
successive lines are slightly shifted to each other, owing to the earth's 
rotation in relation to the satellite velocity. 
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The first preprocessing, operation requires the separation of the 
MSS 4, 5, and 7 spectral bands into separate digital images consisting 
of 810 samples per scan line and 1,000 scan lines each. The 1,000 scan 
lines are selected from each CCT (which contains 2,340 lines) so that 
they cover approximately the north-south extent of a test site. To 
cover the east-west extent of a test site, it is necessary to combine 
data from two adjacent CCT's. So, the first preprocessing operation 
involves stripping data from two CCT's and producing six images (two 
in each spectral band), containing 810 x 1,000 samples each. 

The second preprocessing operation combines adjacent images, so 
one is left with three images (one in each spectral band) containing 
1,620 x 1,000 samples each. These images have a major distortion which 
is an enlargement of the images in the scanning direction (east-west, 
approximately) and results from the unequal sample spacing in the scan- 
ning and raster directions. Visually, the images look like oblique 
rather than vertical photographs. This image distortion causes arti- 
ficial variations in the spatial signatures of the terrain types. For 
example, a square farm would appear as a rectangle or a skewed paralel- 
logram depending on its orientation to the scanning direction. It is 
necessary, therefore, to resample each image in the scanning direction 
in order to equalize the sample spacing in the two orthogonal directions. 

The third preprocessing operation involves resampling of the images 
in the scanning direction and reducing the samples from 1,620 samples per 
line to 1,170 samples per line. After this resampling operation, the 
scale of the images is approximately the same in all directions. 

The fourth preprocessing operation records the digital images as 
transparencies in a laser beam recorder for visual examination. This 
is necessary since many images display missing lines and a few images 
have pronounced line structure. The line structure with periodicity 
of six lines seems to arise from inaccurate calibration of the MSS de- 
tectors (there are six detectors per spectral band) . 

The fifth preprocessing operation involves printing the digital 
data from selected scan lines to ascertain that the artifacts are ac- 
tually present in the data . 

The sixth preprocessing operation is the correction of the artifacts. 
Missing lines are replaced by the average of the two adjacent lines. The 
line structure is corrected by recalibrating the detectors. Each detector 
contributes one sixth the data (every sixth line) of each image. By in- 
tegrating the data from each detector over the entire image, an estimate 
of the calibration errors can be made . First, the samples along each scan 
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line are summed up. Let the sum of samples along the jth line be s(j) 
where : 


j = 1, 2, .... (996) 


(2-1) 


Then, one forms the six sums 


165 

S = Z s(6k + i) = s(i) + s(6 + i) + . . . s{990 + i) (2-2) 

k=0 

where i = 1, 2, 3, 4, 5, and 6. 

S is the sum of all the samples contributed by the ith detector. If 
all detectors were correctly calibrated, then: 


Si - S 2 = S 3 = S4. = S5 = Sfc 


(2-3) 


To correct the detector calibration, the samples of the scan lines num- 
bered (6k + i) where: 


k — 0, 1, 2, • . . 166 and 1 — 1, 2, . . .6 


(2-4) 


are multiplied by the ratio (Si/S^) . 

The correction in the calibration achieved by this method is only 
approximate but has worked very satisfactorily whenever it was applied. 

It is obvious that the preprocessing operations on the digital data 
are quite extensive, involved and time consuming. According to the Modi- 
fied Data Analysis Plan, data from nine ERTS images has been preprocessed 
and is stored on magnetic tapes in the form of 27 separate images. These 
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are being used for the terrain type recognition described in section 5* 


The preprocessing operations have been completed for the following 
ERTS-1 images: 


1031-17325 

1049-17324 

1103-17332 


from Phoenix r Arizona 
from Phoenix, Arizona 
from Phoenix, Arizona 


1040-18201 from Cascade Mountains 
1077-18260 from Cascade Mountains 


1015-17415 from Salt Lake, Utah 

1069- 17420 from Salt Lake, Utah 

1070- 17495 from Salton Sea, California 
1034-17500 from Salton Sea, California 


Figure 2-2 is an example of a preprocessed image that was recorded as a 
transparency* 


2.3 Aircraft Photography 

Low altitude photography (10,000 feet) and high altitude photography 
(^60,000 feet) from all the test sites has been delivered by NASA (the 
Johnson Spacecraft Center or the Ames Research Center). In general, the 
photography is very good and is being fully utilized in the photointerpre- 
tation tasks (section 3) . 

The handling operations on the aircraft images are all photographic 
and involve making prints, enlargements and transparencies. (See, also, 
reference 1.) 

The aircraft photography is the primary source of ground truth for 
the test sites. Without it, it would be impossible to determine the ac- 
curacy of the digital terrain recognition results. 

3 . PHOTOINTERPRETATION TASKS 


Each test site area has been subjected to photointerpretation for 
the purpose of determining ground level conditions. Aircraft photog- 
raphy is the primary data base for the photo interpretation tasks, and 
it is supplemented by maps of the test sites. 

The photointerpretation involves making overlays and comparing the 
ERTS-1 images to the high altitude and the low altitude photography. 
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The photo interpretation results are also compared to the digital 
interpretation results in order to determine the accuracy of the terrain 
recognition operations- (See Reference 1.) 

4. TERRAIN SPATIAL SIGNATURES 

In the ERTS-1 images , the basic unit which conveys information is 
the picture element. Each picture element corresponds to an area incre- 
ment on the surface of the earth equal to the instantaneous field of view 
squared (about 80 meters square) - Each area increment is characterized 
by a brightness value in each spectral band. These brightness values are 
the multispectral information conveyed by the picture element. 

The first level of automatic interpretation is the classification 
of individual picture elements based on multispectral information. This 
level appears to be well developed as evidenced by the wide use of multi- 
spectral recognition software based primarily on the maximum likelihood 
criterion. 

Most of the information in the ERTS images is contained in the dis- 
tribution of the picture elements over the image area. This spatial in- 
formation may be conveniently divided into the information carried by 
small localized areas defined as cells as well as the information carried 
by the distribution of the cells over the image. 

The cell size that has been employed in this investigation is 32 x 
32 picture elements which corresponds to about 2.5 x 2.5 kilometers on 
the surface of the earth. Each cell, therefore, is characterized by 
1,024 picture elements. 

Two questions, then, arise about each cells 

a. Which objects on the surface of the earth it might represent? 

b. How could these objects be identified by manipulating these 
1,024 numbers? 

The first question is answered by the IFOV (80 meters) of the MSS 
scanner. The objects that are faithfully reproduced by this system are 
expected to be in the order of one kilometer. Examination of the ERTS 
photography shows that the objects that can be identified are various geo- 
graphical features and terrain types, such as bays, lakes, rivers, 
mountains, urban areas, cultivated land, desert, forest, etc. 

The second question is more difficult to anwer. It is desirable 
to develop signatures which uniquely characterize the terrain categories. 
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A signature is defined as a specific pattern of numbers. 

It is also desirable that each terrain class be characterized by 
a signature containing less than ten numbers. Assume that such signa- 
tures were known. Then, to assign a cell to one of the terrain types, 
the 1,024 numbers describing the cell are combined to produce a vector 
with less than ten components which is known as a spatial feature vec- 
tor. This vector is compared to the spatial signatures, and the cell 
is assigned to the terrain category whose signature most closely resem- 
bles the vector of the cell. Conceptually, the assignment of cells to 
terrain categories is simple, but there are some difficult practical 
problems in accomplishing it, such as: 

a. The 1,024 numbers of the cell are quite redundant. Their re- 
placement by a feature vector with less than ten components 
is a data reduction operation which may actually destroy in- 
formation. 

b. There is no theoretical treatment that allows one to select 
optimal terrain signatures. 

c. Comparison of a feature vector to the spatial signatures re- 
quires the use of a criterion of "closeness". If the cell is 
to be assigned to the most probable terrain class, the sta- 
tistics of each class must be known. 

As far as is known, there have been two approaches for the deter- 
mination of spatial signatures. One of these is to employ well known 
mathematical functions {as components of the feature vector) , such as 
the variance of the cell numbers, and decide later by the results of 
the recognition whether these mathematical functions are reliable indi- 
cators of terrain type. The results of this approach have not been 
very satisfactory. The other approach consists of examining the Fourier 
transforms of cells and identifying patterns which are unique to each 
terrain type. This is the approach followed in this investigation. 

4. 1 Diffraction Pattern Analysis 

4.1.1 Experimental Procedure 


In the ERTS-1 images, the terrain types that one would like to rec- 
ognize consist of: 


Mountains 
Desert or range 
Cultivated land 


Clouds 
Urban areas 
Hills 


Rivers 

Bodies of water 
Transportation networks 
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Fourier transforms of images displaying one terrain type provide a 
means for isolating terrain signatures. 

The Fraunhofer diffraction pattern of an image is related to its 
Fourier transform. It is well known that, if a transparency is intro- 
duced into the front focal plane of a lens and illuminated by a coherent 
plane parallel beam, the lens will form an image at its back focal plane 
whose amplitude distribution is the Fourier transform of the amplitude 
transmission of the transparency. Specifically, * 


Vv y i 


) = 


A 

jAf 


ff 


+ » 


t (X 

a o 


2 yr 

y 0 )e*p[-j s -( Vf 


y o y f 1 )d V y o 


(4-1) 


where U^(x^,y f ) is the amplitude distribution at the back focal plane, 
X f'^f are the coordinates at the back focal plane, X is the wavelength 

of light, f is the focal length of the lens, A is the amplitude of the 
illuminating beam, and t^x^y^) is the amplitude transmission of the 

transparency. The diffraction pattern D(x ,y ) is the amplitude squared 
Of Uf(x f ,y f ): 


D(x f ,y f ) =U f (x f ,y f ). U*(x f ,y f ) = |u(x f ,y f )| 2 


In additions 


t (x ,y ) = [T(x ,y 1 1 / 2 e + ^ (X o /y o ) 


(4-3) 


where T(x^,y^) is the intensity transmission of the transparency for 
coherent light and <Mx o ,y Q ) is a phase function due to emulsion thick- 
ness that is unrelated to the image. 


*J. W. Goodman, Introduction to Fourier Optics, McGraw-Hill Book 
Co., Inc., New York (1968), p. 86. 
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It is obvious from the above equations that the diffraction pattern 
is not equal to the Fourier transform of the image. However, the diffrac- 
tion pattern is useful because it has the same general structure as the 
Fourier transform. 

Noise or errors in the diffraction pattern associated with the phase 
function *<x o ,y o ) can be eliminated by inserting the transparency in a 
liquid gate filled with a refractive index matching fluid. 

The optical bench employed is shown in Figure 4—1. The laser has a 
wavelength of 632.8 nanometers and its beam is expanded and filtered to 
P^c**iuee a beam that is Gaussian in intensity. The primary lens forming 
the diffraction pattern has a focal length of 48 inches, to produce large 
<iif faction patterns. The frequency scale of the diffraction pattern is 
proportional to the focal length of the lens and the wavelength of illu- 
mination. 

To reduce the overall dimensions of the bench, the transparency is 
located near the lens rather than on its front focal plane. The result 
is that a quadratic phase factor now multiplies U (x ,y ) but does not 
affect the diffraction patterns. f f f 

Liquid gates are not employed because they are inconvenient when 
diffraction patterns from a large number of images are to be obtained. 

The optical bench permits photographing of the diffraction patterns and 
the images simultaneously. It also allows photometric measurements of 
any part (rings or wedges) of the diffraction patterns. Such measure- 
ments of diffraction patterns have been obtained. 

The ERTS images selected are described in Section 4.1.2. The images 
initially employed were positive transparencies on 9-1/2-inch-wide film. 
They produced small diffraction patterns. It was decided, therefore, to 
reduce the scale of the ERTS-1 images by a factor of three and develop 
the reduced images to a gamma of two to compensate for the square root 
factor in Equation 4-1* The maximum resolution in the reduced transpar- 
encies was estimated at about 25 line pairs per millimeter. 

The diffraction patterns have some artifacts not related to the ter- 
rain images: 

a. Rings due to the circular aperture employed to limit the image 
area being transformed; these rings are also present in a digi- 
tally computed Fourier transform. 

b. Two frequency spots due to the line structure present in the 
ERTS images; the line structure is a characteristic of the 
mu ltispectral scanner and electron beam recorder system . 
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Fig. 4-1 — Coherent optical bench 
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c. Artifacts due to the optical bench, such as lens aberrations, 
laser beam nonuniformity, phase modulation by the transpar- 
encies . 

d. The central order is so bright that it usually saturates the 
film over a sizable area surrounding the central order and 
masks low frequency components. 

To eliminate or suppress artifacts in the diffraction patterns, a 
mask is employed when photographing the diffraction patterns. The mask 
is itself the photograph of the diffraction pattern of an image area 
(from ERTS images) with no detail, such as areas of water from lakes or 
the ocean. 

Figure 4-2 provides a comparison of a typical diffraction pattern 
with and without the mask. It is obvious that the mask enhances the low 
frequency structure of the diffraction pattern and suppresses artifacts. 
For this reason, these masks have been used consistently. 

Rings in the diffraction patterns are associated with the aperture 
only, not with image detail within the aperture. The spacing (d) be- 
tween adjacent dark bands in the Airy disk is d = 1.22Af/a, where A is 
the wavelength of light, f is the focal length of the lens, and a is 
the aperture diameter. 

Increasing the aperture decreases the ring spacing and improves the 
resolution of the diffraction pattern. However, the central order becomes 
very intense and obscures low frequency components even though a Fourier 
plane mask is being employed. In addition, if the aperture is too large, 
the image will include more than one type of terrain. 

The aperture size was adjusted so that low frequencies are not ob- 
scured by the central order while the ring structure is not so coarse 
that frequency spots due to the image are lost by excessive broadening. 

A sinusoidal frequency v present in the image will produce a spot located 
at a distance y from the center of the diffraction pattern such that y = 
f Av, where f is the focal length of the lens and A is the wavelength of 
light. The aperture was set at 2 millimeters and it produced ring spacing 
of 0.471 millimeter, which corresponds to a frequency of 0.61 cycle per 
millimeter in the image. 

The important spectral content of the diffraction patterns exists 
for frequencies larger than 1 cycle per millimeter. This frequency cor- 
responds to about 1/3 cycle per millimeter for the original image size 
(9-1/2-inch format) or 1 cycle per 3 kilometers on the ground. 
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Diffraction pattern without mask. 217-1 

Fig. 4-2 — Phoenix, Arizona — two diffraction patterns from same area: one 
with a spatial filter, one without 
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4.1.2 Description of EETS-1 Diffraction Patterns 


Table 4-1 lists the ERTS-1 images that were employed for develop- 
ing spatial signatures for various types of terrain. These images are 
shown in Figures 4-3, 4-5, 4-7, 4-9, 4-11, 4-13, and 4-15. Diffraction 
patterns were obtained from the image areas identified by numbered cir- 
cles. The corresponding diffraction patterns are shown in Figures 4-4, 
4-6, 4—8, 4—10, 4-12, 4—14, and 4—16, respectively. The diffraction 
patterns were photographed by using the Fourier plane mask discussed 
in section 4.1.1. 


Table 4-1. List of ERTS-1 Images 



Image Identification 

Area Covered 

4-3 

030 

NASA 

ERTS-E-1070-17495-5 

Imperial Valley, California 

4-5 

084 

NASA 

ERTS E-1070-16073-5 

New Orleans Area 

4-7 

098 

NASA 

ERTS E-1041-18253-5 

Cascade Mountains, Washington 

4-9 

101 

NASA 

ERTS E-1015 -17415-7 

Salt Lake City Area 

4-11 

116 

NASA 

ERTS E-104 0-18201-5 

Cascade Mountains, Washington 

4-13 

181 

NASA 

ERTS E-1031 -17325-5 

Phoenix Area 

4-15 

217 

NASA 

ERTS E-1031-17 325-7 

Phoenix Area 


4.1.3 Cultivated Land Signatures 

In Figure 4-4, diffraction pattern no. 030-3 from a portion of the 
Imperial Valley displays a unique signature for the cultivated land of 
this region. The signature consists of two orthogonal rows of frequency 
spots. One of the rows is slightly titled to the horizontal. A third 
row, which is horizontal, is barely visible and is a result of the line 
structure of the image. The two orthogonal rows are due to the square 
fields, whose size and dimensions are highly repeatable. The frequency 
spots are multiples of the fundamental frequency. 

That so many high frequency spots are visible is due to the Fourier 
plane mask. In Figure 4-6, there are two diffraction patterns, nos. 084-1 
and 084-3, from cultivated land along the Mississippi River. The fields 
are elongated rectangles, with their long dimension approximately normal 
to the river. In diffraction pattern no. 084-3, there are two orthogonal 
rows of spots but the spacing of the spots in the rows is different because 
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Fig. 4-3 — Salton Sea, California — circled areas show scenes from which 
diffraction patterns were produced 
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030-1 Mountainous 030-2 Urban 



030-3 Cultivated 

f Spots due to fields 
0 Spots due to scan lines 


Fig. 4-4 — Salton Sea, California — diffraction patterns from circled areas of 
Fig. 4-3 
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084-1 Cultivated 084-2 Urban 



084-3 Cultivated 084-4 Urban 


Fig. 4-6 — Diffraction patterns from circled areas of Fig. 4-5 
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of the elongated rectangular shape of the farms. In diffraction pattern 
no. 084-1, there is only one row of spots. Orthogonal to it, there is an 
almost continuous linear structure instead of another row of spots. In 
Figure 4-5, the farms appear to have the same shape as the farms that pro- 
duced diffraction pattern no. 084-3. However, crops in adjacent farms 
appear to be similar (in terms of reflectance) and the result is that the 
farms appear to be wider and of irregular shape. This comparison brings 
out a crucial point: the diffraction pattern is affected not only by the 
field size and shape but also by the crop reflectances. 

In Figure 4-11, there are two encircled areas that contain circular 
fields. In Figure 4-12, diffraction pattern no, 116-1 has a structure 
that corresponds to the farm pattern. There is a broken circular ring 
that resulted from the interference of the circular aperture with the 
numerous circular fields. Then, there are at least two frequency spots 
at about double the frequency of the circular ring. These appear to be- 
long to two orthogonal and equally spaced rows of frequency spots, which 
are due to the uniform spacing of circular fields in rows and columns. 

Diffraction pattern no. 116-2 displays a complex pattern. There ap- 
pears to be a near vertical row of frequency spots and a much fainter 
horizontal row normal to the vertical one. Also, there are two orthog- 
onal fans (light diffracted into broad lines) and a smaller fan located 
about 45 degrees to the other two fans. The orthogonal rows appear to 
be related to the regular spacing of the circular farms. Displacement 
of frequency spots in the row is slightly smaller than the displacement 
of the frequency spots for diffraction pattern no. 116-1. This agrees 
with the observation that the farms for diffraction pattern no. 116-2 
appear larger than the farms for diffraction pattern no. 116-1. The 
origin of the fans is not obvious, but they may be related to the vari- 
ation of reflectance in the farms (white, gray, and black) . 

There are two cultivated areas in Figure 4-13, with diffraction 
patterns no. 181-2 and 181-3 shown in Figure 4-14. Both patterns have 
two orthogonal rows of frequency spots, which are, however, very weak. 

In addition, there is a lot of light diffracted in other directions. 

The farms in Figure 4-13 are characterized by many vertical and horizon- 
tal edges. However, due to the reflectance of the crops, the edges ap- 
pear to be random in size and distribution. In turn, the diffraction 
patterns are characterized by many frequency spots. However, the prin- 
cipal spots marked in the figure do correspond to the predominant farm 
size in that region, which is a quarter-mile- square section. 

In conclusion, it appears that cultivated land can be identified 
by a signature, in the diffraction patterns, that consists of two orthog- 
onal rows of frequency spots. The spacing of the frequency spots may not 
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Fig. 4-7 — Cascade Mountains, Washington — ERTS image 1041-18253-5 
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098-1 Mountainous 098-2 Mountainous 



098-3 Snow-covered mountains 

Fig. 4-8 — Diffraction patterns from circled areas of Fig. 4-7 
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Fig. 4-9 — Great Salt Lake, Utah — ERTS image 1015-17415-7 
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101-1 Urban 101-2 Urban 



101-3 Mountainous 


Fig. 4-10 — Diffraction patterns from circled areas of Fig. 4-9 
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116-3 Mountainous 116-4 Mountainous 

Fig. 4-12 — Diffraction patterns from circled areas of Fig. 4-11 
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Fie. 4-13 — Phoenix, Arizona — ERTS image 1031-17325-5 
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181-1 Mountainous 181-2 Cultivated 



181-3 Cultivated 181-4 Urban 


Fig. 4-14 — Diffraction patterns from circled areas of Fig. 4-13 
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be the same in both rows. For some images, the signature must be de- 
tected in a diffraction pattern that contains many other strong frequency 
components. 

4.1.4 Mountainous Terrain Signatures 


In Figure 4-4 diffraction pattern no. 030-1 shows diffraction of 
light (fans) in a direction normal to the mountain ridges (see Figure 
4-3). The fans are rather broad and contain strong frequency components. 

In Figure 4-12, there are three diffraction patterns from mountain- 
ous terrain in the Cascade Mountains. These diffraction patterns have 
characteristics similar to diffraction pattern no. 030-1. 

Diffraction pattern no. 101-3 in Figure 4-10 was produced by the 
mountainous terrain of Figure 4-9. This pattern is also characterized 
by a broad fan in a direction normal to the mountain ridges. In Figure 
4-12, diffraction patterns nos. 116-3 and 116-4 are from the mountainous 
terrain of Figure 4-11. Diffraction pattern no. 116-4 is again similar 
to the other patterns of mountainous terrain. Pattern no. 116-3 displays 
an extremely broadened fan, which can be attributed to variations in the 
orientation of the corresponding mountain ridges in Figure 4-11. 

In Figure 4-14, diffraction pattern no. 181-1 was produced by moun- 
tainous terrain from Figure 4—13 . This pattern also shows a broad fan 
normal to the mountain ridges. In conclusion, the diffraction patterns 
of mountainous terrain show diffraction of light along broad fans normal 
to mountain ridges. An obvious, unique signature for mountains has not 
been identified. 

4.1.5 Urban Area Signatures 


In Figure 4-4, diffraction pattern no. 030-2 was produced by a small 
town in the Imperial Valley (see Figure 4-3) . There is nothing distinc- 
tive about this pattern, and diffraction of light along fans appear to be 
due to the edges of the cultivated region surrounding the town. 

In Figure 4-10, diffraction patterns no. 101-1 and 101-2 are from 
the urban area of Salt Lake City. Several major arteries are seen in 
the encircled areas in Figure 4-11. The diffraction patterns show sev- 
eral rows of frequency spots. The horizontal row is due to the line 
structure of the image (a scanner/ recorder artifact) . Tilted to that 
row is a pair of orthogonal rows due to the repetitive pattern of orthog- 
onal major arteries of Salt Lake City. 

In Figure 4-14, diffraction pattern no. 181-4 is produced by an area 
of Phoenix from the image of Figure 4-13. This pattern shows only one 
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horizontal row of frequency spots owing to the line structure of the im- 
age. 


In Figure 4-16/ diffraction pattern no. 217-1 shows two orthogonal 
rows of frequency spots owing to the major street pattern of Phoenix 
(see Figure 4-15) . In conclusion, urban areas can be identified by sig- 
natures resulting from patterns of parallel major arteries. This signa- 
ture becomes evident only in the IR-2 (0.8- to 1 .1 -micrometer) band for the 
cities of Phoenix and Salt Lake. The difference in signature from the 
same area photographed in two different spectral bands is seen by com- 
paring the relative strengths of the diffracted spots in Figures 4-14 
(no. 181-4) and 4-16 (no. 217-1) . In New Orleans, such a street pat- 
tern does not exist. Instead, New Orleans is characterized by inter- 
sections of major highways that axe visible in the red band image. It 
appears that a signature for small towns does not exist since secondary 
street patterns axe not resolved. 

4.1.6 Other Terrain Signatures 


Diffraction patterns were obtained also from other terrain features 
such as broken clouds, rivers, and transportation networks. These have 
not been presented because they are not expected to produce signatures. 

4.1.7 Conclusions 


The search for terrain signatures has produced the following results: 

a. The diffraction patterns of mountains show clusters of intensity 
at low spatial frequencies and diffracted light spreading in 
wedge-shaped patterns (fans) away from the center. 

b. For cultivated land, a signature consisting of orthogonal rows 
of frequency spots has been identified. For some images, the 
signature may be weak in relation to other components of the 
diffraction pattern. 

c. For urban areas, a signature consisting of orthogonal rows of 
frequency spots has been identified. This signature becomes 
evident only in the IR-2 image and cannot be confused with the 
signature for cultivated land, which is detected in the red band 
(0.6 to 0.7 micrometer). In the red band, a signature for some 
urban areas can be obtained with a high degree of probability 
from intersecting major highways. 

d. Bodies of water have diffraction patterns with very low energy 
at frequencies other than the central lobe. Water can be iden- 
tified by low reflectances in nearly all spectral bands, and. 
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Fig. 4-15 — Phoenix, Arizona — ERTS image 1031-17325-7 
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217-1 Urban 


Fig. 4-16 — Diffraction pattern from circled area of Fig. 4-15 



therefore, the use of a spatial signature is usually redundant. 

e. Deserts also have diffraction patterns with low energy levels 
of light diffracted at most frequencies other than the central 
lobe and very low frequencies. 

f. Rivers and transportation networds are linear features that can 
be recognized by using algorithms operating directly on the im- 
age and. testing for topographic characteristics. Although linear 
features have Fourier transforms that display high spatial fre- 
quency content in a direction perpendicular to the length of the 
feature (i.e., normal to the river bands), there is no indication 
that the Fourier transform provides better signature discrimina- 
tion that the image itself. 

In summation, the terrain features have been divided into three cate- 
gories: 

a. Features for which spatial signatures are not appropriate - des- 
erts and bodies of water. 

b. Features that can best be identified by spatial signatures de- 
veloped directly from an image - broken clouds, rivers, and 
transportation networks. 

c. Features that can be identified by spatial signatures isolated 
in the diffraction patterns or the Fourier transforms - culti- 
vated land, some urban areas, mountains, and hills. 

4.2 Digital Signatures 

The diffraction pattern annalysis of the previous section suggests 
that signatures for the various terrain types can be found in the digital 
Fourier transforms of cells. 

Figure 4-17 shows the amplitude of the Fourier transforms of four 
cells. The horizontal direction coincides with the scanning direction 
of the multispectral scanner. The frequencies represented by the squares 
vary from 0.39 to 5.9 cycles per kilometer. Due to the ground resolved 
distance of the MSS data, the street pattern of Phoenix is not resolved 
in the red band image. Also, the boundaries between adjacent farm plots 
are not resolved. The farm pattern within a cell seems to be established 
by crops of equal brightness. If such crops exist in adjacent plots, they 
have the appearance of larger farms of irregular shape. Also, the farm 
pattern is affected by clouds and highways. Comparison of the digital 
Fourier transforms to the diffraction patterns of the previous section shows 
many similarities as expected. 
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Fig. 4-17 — Digital Fourier transforms of cells 


35 







The Fourier transforms were extensively analyzed to determine what 
portions of the transforms contain information that can be used to rec- 
ognize terrain types. It is desirable that areas of the transforms not 
contributing to the recognition of terrain be excluded from the spatial 
feature measurements. Furthermore, a minimum number of spatial features 
should be extracted. 

The analysis of the Fourier transforms has produced the following 
results: 

a. It appears that frequencies larger than 3.5 cycles/km contain 
the information needed to discriminate between the terrain 
types. Frequencies less than 3.5 cycles/km carry significant- 
ly less information. 

b. Regardless of terrain type, there is significant energy along 

the vertical frequency column (f = 0 ,f ) where x is the hori- 

x y 

zontal direction and y the vertical. In addition, this column 
has local peaks at the frequencies f = 2 . 1 , 4.2 and 6.3 cy- 
cles/km. These appear to be multiples of 1/6 the scanning rate 
(12.6 lines/km) of the Multispectral Scanner. Since the scan- 
ner has six detectors per spectral band, it is conceivable that 
these frequencies are artifacts due possibly to small errors in 
detector response remaining after calibration. The calibration 
errors may not be large enough to produce line structure in an 
MSS image. In any case, the frequency column mentioned contains 
frequency components of the same order of magnitude as the com- 
ponents which have been related to farms. The discrimination 
results between the various terrain types were improved when this 
column was replaced by zeros. 

Measurements of four spatial features are made as follows: 

a. All frequencies less than 3.5 cycles/km and larger than 5.9 cy- 
cles/km are eliminated. 

b. In the remaining Fourier transform, the largest peak (or maxi- 
mum) is determined. 

c. The energy* in a sector which is tt/8 radians wide and centered 
on the largest peak is determined. The energy within this sec- 
tor (Si) is one of the features. 

d. The energies in similar size sectors which are displaced from 
the first one by tt/4, tt/ 2 and 3u/4 radians in a clockwise dir- 
ection are also determined and constitute the features S £, S 3 
and S 4 , respectively. (See Figure 4-18) 

^Energy is defined as the sum of the amplitudes of the Fourier transform 
pixels. 


36 




3 



Other spatial features that have been explored are: a number repre- 

senting a count of high derivative values within the cell and the numbers 
of pixels with values less than 5 and above 55. These features are 
rather insignificant. 

Upon completion of the spatial feature selection, the classification 
of the cells can then be carried out using only the feature vectors. The 
classification involves assignment of the cells to the terrain types by 
partitioning the feature space. For this investigation, the division of 
the feature space was done with a heuristic algorithm, in order to ana- 
lyze the usefulness of the features that were selected. This heuristic 
algorithm consists of logic statements containing many thresholds. (See 
section 5.1.). 

4.3 Spatial Feature Measurements 

The generation of the feature vectors is a time consuming data re- 
duction operation. On a cell basis, the 1,024 pixels are reduced to 
seven numbers*. The reduction in data is about 439 to 1 when the spectral 
data (MSS 4, 5 and 7) is taken into account. For each cell, the input 
data consists of 3 x 1,024 = 3,072 numbers. The time required to compute 
the seven feature vector components is about 1.3 seconds in the IBM 370/ 
158 computer. The time required to compute the vectors for an entire 
ERTS image is over two hours. It is desirable that the feature measure- 
ments be executed at a substantially faster rate. 

Also, it is desirable to reduce the size of the cell to less than 
2.5 km square in order to reduce the probability that a cell will con- 
tain more than one type of terrain. Reducing the cell size digitally 
to less than 32 x 32 pixels makes the Fourier transform grainy and in- 
troduces considerable frequency quantization noise to the spatial fea- 
tures S;l, S 2 , S 3 , S 4 . To reduce this quantization noise, it will then 
be necessary to interpolate and resample the Fourier transforms. For 
example, if the digital cell size is reduced to 16 x 16 pixels, the 
Fourier tranforms will also contain 16 x 16 samples. These would have 
to be interpolated and resampled so that they contain at least 32 x 32 
samples. The interpolation and resampling operations will consume more 
computer time. 

The spatial features can be measured more rapidly and efficiently 
by measuring the energy in specified portions of the diffraction patterns 
of cells. In other words, the spatial features of cells can be measured 
electro-optically with photodetectors after the diffraction patterns of 
the cells have been formed optically as described in section 4.1. The 
detector outputs can be digitized and then introduced into a computer as 
feature vectors. 

*The seven features are obtained from the MSS4, 5 and 7 spectral bands. 
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An added advantage of the electro-optic measurement of features is 
that the cell size can be reduced without introducing frequency quanti- 
zation noise as occurs with the digital Fourier transforms. 

However, the main advantage of the electro-optic measurement of the 
features is the speed with which the feature vectors can be measured. 

If one assumes that the feature vector for a cell is measured conserva- 
tively at the rate of one vector per millisecond, the speed advantage 
is 1,300:1 in comparison to the computer. If the cell size is reduced 
to an equivalent of 16 x 16 pixels, the feature vectors of an entire 
ERTS image can be measured at about 21 seconds. 

5. TERRAIN TYPE RECOGNITION 

5.1 Heuristic Recognition Algorithm 

This algorithm was developed by examining the spatial features of 
cells of the same class. Then, the vector space was divided by thresh- 
olds forming hyperplanes. The algorithm was then employed to classify 
the cells from a portion of ERTS-1 image no. 1049-17324-5 containing 
Phoenix, Arizona and the surrounding area in southern Arizona (about 
8,000 square km). The thresholds in the algorithm were then adjusted 
until the recognition results were optimized^. 

In order to determine the accuracy of the heuristic algorithm, two 
assignment matrices of cells were developed by photointerpretation. One 
matrix represents assignments using only the ERTS-1 red band image 1049- 
17324-5. The other matrix represents assignments using all available 
information (maps, aerial photography, and a land use map*). Comparing 
these two matrices, one finds that the first one has about 2.5% errors, 
which indicates the error rate of a photointerpreter if he were to use 
only the red band ERTS-1 image. Most of the errors are due to isolated 
urban areas near cultivated land, mountains, or desert. 

The computer classification matrix was compared to the most accurate 
photointerpreter matrix. The results are tabulated in Table 5-1 and 
demonstrated that a four-dimensional vector employing only the spatial 
features of Section 4.2 is sufficient to recognize major terrain types. 


^Gramenopoulos , N., "Terrain Type Recognition using ERTS-1 MSS image 
NASA SP-327, Symposium on Significant Results Obtained from the Earth Re- 
sources Technology Satellite-1, March 5-9, 1973. 

*Poulton, C.E., Schrumpf, B. J. , Johnson, J. R. , "Ecological Resourc 
Analysis from High-Flight Photography for Land Use Planning", Applied Re- 
mote Sensing of Earth Resources in Arizona, Proceedings 2nd ARETS Symposi 
University of Arizona, November 2-4, 1971. 
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TABLE 5*1 COMPARISON OF CLASSIFICATION RESULTS* 


PHOTOINTERPRETER’S ASSIGNMENT 


CLASS 

1 

2 

3 

4 

5 

6 

7 

8 

TOTAL 

0 

0 

0 

3 

15 

22 

6 

14 

0 

60 

1 

61 

0 

0 

0 

0 

0 

0 

1 

62 

& 2 
s 

0 

2 

0 

1 


0 

0 

1 

4 

i 

1 

0 

417 

6 

7 

8 

5 

0 

444 

H 

cn 4 

< 

0 

0 

3 

248 

1 


2 

D 

264 

Ph c 

W D 

Eh 

0 

0 

5 

3 

131 


4 

0 

145 

D 

I 

0 

0 

1 

5 

3 

71 

8 

0 

88 

U 

7 

0 

0 

1 

0 

0 

0 

4 

1 

5 

8 

0 

1 0 

0 

0 

0 

0 

0 

13 

13 

TOTAL 

62 

1 

: 2 

i 

i 

430 

278 

164 

96 

37 

16 

1085 


Class Identification 

0 = Unidentified 4 = Farms 

1 - Clouds 5 = Mountains 

2 = Water 6 = Urban 

3 = Desert 7 - Riverbed 

6 = Cloud Shadows 

♦Heuristic algorithm using spatial features only. 


























































Many cells contain two terrain types, and the computer assignment of 
a cell is deemed correct if one of the terrain types is correctly recog- 
nized. Clouds and cloud shadows can confuse the classification of under- 
lying terrain* Therefore, cells that contain even a small cloud or a 
cloud shadow are not assigned to a terrain type* The underlying terrain 
can be classified from different ERTS-1 images when it is cloud free* 

Whenever cells cannot be classified with reasonable confidence, they 
are assigned to an unidentified group. This can be justified by the fact 
that all the information available (the other three spectral images) is 
not being utilized yet, and it is desirable to defer making decisions on 
questionable cells until the spectral information is introduced. Due to 
scale and resolution, only three types of natural terrain can be recog- 
nized: desert, mountains, and riverbeds or flood plains. 

Due to the population density of the area and man's activities, the 
signatures of the various terrain types vary considerably, thus increas- 
ing the probability of error. The following examples have been noted; 

a. Highways and canals cutting through desert can be confused with 
flood plains. 

b. Urban development has extended into the hills north of Phoenix 
and new developments farther north into the desert have been 
initiated. These areas are now urban, but retain some of their 
former desert or mountainous characteristics, thus producing 
some classification errors. 

(See Figures 5-1 and 5-2.) 

c. The urban areas are spreading into farmland near Glendale and 
farms have been noted surrounded by urban areas or in the pro- 
cess of being cover ted to new developments. Such areas have 
produced classification errors or unidentified cells* Sun 
City, a circular urban development near Glendale, is quite 
different in appearance than downtown Phoenix. 

d. The Salt River bed runs through Phoenix. The waters of this 
river have been diverted for irrigation, and the river bed is 
now dry. The riverbed is used for urban or industrial pur- 
poses, but it is thinly populated in comparison to Phoenix. 
Classification errors and unidentified cells were found along 
the river bed. 

It was also noted that farms are a dominant terrain type. Most 
cells containing two terrain types with farms being one of them were 
assigned to the farm category. 
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Fig. 5-2 — Low altitude photograph of Phoenix, Arizona suburbs — taken by 
NASA earth resources aircraft 
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The classification results are shown in the annotated photograph 
of Figure 5-3. The letters were superimposed on the MSS-5 image and 
are centered within each cell. The interpretation of the symbols is: 


D = Desert 
U = Urban 
F = Farms 
M = Mountains 


R = Riverbeds 
C = Clouds 
S = Cloud Shadows 
Blank = Unidentified 


The recognition results for most terrain types were very good: 95% 

for desert, 89% for farms, 80% for mountains, 74% for urban areas, 98% 
for clouds, 100% for water, 81% for cloud shadows. Only river flood 
plains, which are peculiar geographic features of southern Arizona, were 
recognized poorly (11%). 

Due to the resolution of the system, urban areas cannot be recognized 
with a very high degree of confidence until the information in the other 
spectral bands has been exploited. Most of the errors in the urban 
category seem to occur in thinly populated areas associated with recent 
urban expansion into desert, farms, riverbeds, and hills. 

From the recognition results, it appears that the spatial features 
selected are sufficient for the purposes of classifying terrain types. The 
four spatial features are derived from the Fourier transforms of the 
cells. (See section 4.2.) 

5.2 Maximum Likelihood Criterion 


5.2.1 Integration of Multispectral and Spatial Information 

Integration of spectral and spatial features is required in order to 
exploit the information available in the ERTS-1 imagery. It is also de- 
sirable to employ a general recognition algorithm which is independent of 
the data characteristics. The maximum likelihood criterion* which has 
produced good results with multispectral data has been selected. This 
criterion requires training data and is optimum only when the terrain 
classes form multivariate Gaussian distributions in feature space. In 
addition, no class is allowed to have a feature vector component which 
is always zero. 

The spectral features are the average cell brightnesses in the 
MSS 4, 5 and 7 spectral bands. 


*Marill, T. , Green, D. M. , "Statistical Recognition Functions and 
the Design of Pattern Recognizers" , IRE Transactions of Electronic Com- 
puters, December 1960, p. 472. 
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Fig. 5-3 — ERTS-1 image 1049-17324-5 and terrain classification results 
from heuristic algorithm 
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The four spatial features described above were combined with three 
spectral features to form a seven-dimensional vector describing each cell. 
Using image 1049-17324 as training data, the statistics (covariance matri- 
ces) for desert, farm, mountains and urban areas were computed. Then, 
using the statistics, the same input data and the maximum likelihood cri- 
terion, the cells were reclassified in one of the four categories. The 
recognition accuracy was 54% for desert, 92% for farms, 95% for mountains, 
and 92% for cities. Desert was poorly recognized (many desert cells were 
assigned to the mountain category), and it was suspected that the statis- 
tics for the classes were not Gaussian. 

5.2.2 Gaussian Statistics of Class Vectors 


Analysis of histograms of the classes showed that no component of 
any class was Gaussian. In particular, the distribution for the spatial 
features appeared like the Rayleigh rather than the Gaussian. All com- 
ponents were positive with small means and large standard deviations. 

See Figures 5-4 and 5-5.) 

5.2.3 Non-Linear Transformations of Class Vectors 


Non-linear transformations applied to the feature space change the 
statistics of the classes. A different transformation is required for 
each feature component. The transformations must be such that large 
component values are reduced while small values remain unchanged. (See 
Figure 5-6. ) Known transformations of this type are the logarithms on 
various bases such as the natural logarithm or powers less then unity 
such as the square or third roots. If a logarithmic transformation is 
selected, the base of the logarithm is adjusted for each feature com- 
ponent so that the class distributions in this component become 
approximately Gaussian. Similarly, if a power transformation is 
employed, the power is adjusted for each component so that the class 
distributions in this component become approximately Gaussian. 

For example, if the power is 0.5 for all components, then each compo- 
nent of every feature vector is replaced by its square root. 

Both logarithmic and power transformations were extensively ana- 
lyzed. The logarithmic transformations gave good results, but the re- 
sultant distributions were sensitive to amplitude variations in the 
data. In other words, if the logarithm on a certain base is used for 
one component (for example, the MSS 7 band) the distribution for each 
class may or may not approximate the Gaussian depending on solar illu- 
mination which changes the range of values obtained in this band. 

Finally, the non-linear transformations selected are all powers 
less than 1. For image 1049-17324, the powers for the various 
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-3ff -2a -la M + la +2a +3a +4 a -3a -2a -la M +la +2a +3a + 4a 

(a) Desert, Yj spatial component* (b) Desert, MSS 7 



-3a -2a -la M + la +2a +3a + 4a -3a -2a -la M +la +2a +3a +4a 


(c) Desert, MSS 5 (d) Desert, Y 2 spatial componentt 

Fig. 5-4 — Histograms of desert distributions in four components (in units of 
standard deviations about the means) 

*Y t = Sj + S 2 + S 3 + S4 
tY 2 = S t • S 2 • S 3 • S 4 
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(a) Farms, derivative count 


(b) Desert, derivative count 



(c) Mountains, derivative count (d) Cities, derivative count* 

Fig. 5-5 — Histograms of terrain classes in one component (in units of 
standard deviations about the means) 


* Derivative count = number of cell derivative pixels exceeding in amplitude a threshold 
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components range from 0.8 to 0.025 and have been optimized to produce 
excellent recognition results in all four classes: desert 89%, farms 
97%, mountains, 96%, and city 95%. In arriving at an optimum trans- 
formation for each component, Gaussian distributions cannot be achieved 
for all classes. In other words, as one adjusts the transformation of 
say the MSS 5 band, the desert distribution may become more Gaussian 
while the city distribution may become less Gaussian. There is no rea- 
son to believe that all distributions can always be made Gaussian in 
all components by appropriate transformations. To reduce the possi- 
bility of such problems and maintain high recognition rates, the number 
of different classes should be kept small. (See Figures 5-7 and 5-8.) 

Also, the importance of each component in the recognition of each 
class is not the same. A component may be essential for recognizing 
one class and relatively unimportant for recognizing other classes. 
Knowledge of this can be taken advantage of when selecting a transfor- 
mation for a component. The transformation is adjusted so that a sym- 
metrical distribution is obtained for the class that the component is 
most important. The effectiveness of each transformation can be judged 
from the recognition rates achieved. 

5.2.4 Recognition Results 

An important question that arises is the variability of the terrain 
class statistics with time of year and geographic location. If the sta- 
tistics vary substantially, one must have statistics for each image to 
be processed. On the other hand, it would be desirable to have one 
set of statistics stored that would be applicable to a large geographic 
area for imagery acquired at any time of the year. 

To explore this possibility, data from ERTS-1 images 1031-17325 and 
1103-17332 were subjected to terrain classification using the statistics 
developed from training data of ERTS-1 image 1049-17324. The data from 
all three images cover the same general area around Phoenix, Arizona. 

The dates of acquisition are August 23, 1972, November 3, 1972, and Sep- 
tember 10, 1972, respectively. 

The data of image 1031-17325 was first subjected to the non-linear 
transformations described in the previous section and was then classified 
using the statistics from image 1049-17324. The recognition results are 
shown in Table 5-2. The detection rates for the four classes were: 93% 
for desert, 93% for mountains, 100% for farms, and 85% for urban areas. 
The accuracy of recognition of urban areas was not above 90% because many 
urban cells containing parks or golf courses were assigned to the farm 
category. 
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-3a -2a -la M +la +2a +3a +4a -3a -2a -la M +la 42a 43a 44a 


(a) Desert, MSS 5' 


(b) Desert, Y[ 



-3a -2a -la M 4la 42a 43a 44a -3a -2a -la M 4la 42a 43a 44a 


(c) Desert, (d) Desert, MSS V 

Fig. 5-7 — Histograms of desert distributions after nonlinear transformations, 
compare to Fig. 5-4 
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(a) Cities, derivative count' 


h 


-3a 



-2a -la M +la +2a +3a +4 a 

(b) Mountains, derivative count' 




(c) Desert, derivative count' (d) Farms, derivative count' 


Fig. 5-8 — Histograms of class distributions after nonlinear transformations — 
compare to Fig. 5-5 
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Table 5-2. Comparison of Classification Results - 
ERTS-1 Image 1031-17325 


Photo interpreter 


CLASS 

1 

2 

3 

4 

5 

1 

429 

0 

6 

0 

6 

M 2 
U) 

1 

407 

0 

10 

7 

t 3 

31 

0 

114 ! 

0 

1 

U 4 

1 

1 

3 

57 

11 

TOTAL 

462 

408 

123 

67 

25 








Table 5-3. Comparison of Classification Result - 
ERTS-1 Image 1103-17332 


Photointerpreter 


CLASS 

1 

2 

3 

4 

5 * 

1 

448 

1 

14 

13 

2 

M 2 

<n 

17 

443 

7 

6 

31 

S 3 

g* 

6 

0 

74 

2 

1 

0 

o 4 

0 

0 

0 

20 

0 

TOTAL 

471 

444 

95 

41 

34 








*The class statistics were derived from training cells 
of image 1049-17324 of September 10, 1972. This is in 
, _ the dry season when riverbeds turn into desert. Hence, 

^ no riverbed class statistics had been developed. 

2 - Farms r 

3 = Mountains 

4 = Urban 

5 = Riverbeds 
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For image 1103-17332, it was noted that the average brightness in 
all bands was substantially reduced due to lower solar elevations during 
the month of November. It was felt that the data should be compensated 
for the change in solar elevation. The average feature space vectors 
were determined for images 1049-17324 and 1103-17332. Then, the vectors 
of image 1103-17332 were linearly scaled so that the average vectors for 
both images would be the same. This compensation assumes that the change 
in statistics between September and November is simply a change in scale 
of the feature space. The compensation applied was only an approxima- 
tion. Then, the compensated data was subjected to the non-linear trans- 
formations described in the previous section and was afterwards classi- 
fied using the statistics from image 1049-17324. The recognition results 
are shown in Table 5-3. The detection rates were as follows: 95% for 

desert, 100% for farms, 78% for mountains, amd 49% for urban areas. It 
is obvious that the statistics employed were not optimum for image 1103- 
17332 despite the simple compensation applied to the feature space. Some 
mountain and urban areas were assigned to the desert and farm categories. 
Figure 5-9 shows the ERTS image number 1103-17332 with the recognition 
results superimposed as annotation. 

Table 5-4 summarizes the recognition results for the maximum likeli- 
hood criterion and the heuristic algorithm. This table provides a com- 
parison of detection rates for various sets of data and features being 
employed: 


a. Data from ERTS-1 image 1049-17324 was used. The feature vectors 
were three-dimensional, and the features were the average cell 
brightness in the MSS 4, 5 and 7 spectral bands. The same vec- 
tors were used as training data to compute the class statistics 
and were then classified by the maximum likelihood criterion. 

The recognition results are what one could expect when using 
the maximum likelihood criterion in multispectral recognition. 

b. Data from ERTS-1 image 1049-17324 was used. The feature vectors 
were four-dimensional with spatial features only. All other 
conditions were as in case a, above. 

c. The spatial and spectral features of case b and a, respectively, 
were combined to form seven-dimensional vectors. All other con- 
ditions were as in case a, above. 

d. The vectors of case c were non-linearly transformed as described 
in section 5.2.3. Then, the class statistics were computed from 
them, and they were subsequently classified by the maximum like- 
lihood criterion. 
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Fig. 5-9 — ERTS-1 image 1103-17332 and terrain classification results 
(maximum likelihood criterion, combined spectral and spatial features) 
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Table 5-4. Recognition Results for the Maximum Likelihood 
Criterion and the Heuristic Algorithm 
Detection Rates, Percent 


Processing Conditions 


1049-17324: 

3 spectral features only 


1049-17324: 

4 spatial features only 


1049-17324: 

Combined 7 -dimensional 
feature vectors 


1049-17324: 

Transformed 7-dimensional 
feature vectors 


1031-17325: 

Transformed, 1049 
statistics 


1103-17332: 

Compensated to solar 
elevation and transformed 
1049 statistics 


1049-17324: 

Spatial features only, 
heuristic algorithm 


Desert Farms Mountains Urban Areas 

87 93 96 86 

8 83 97 70 

54 92 97 92 

89 97 96 95 

93 100 93 85 

95 100 78 49 

97 89 80 74 
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e. The data was seven-dimensional vectors from EETS-1 image 1031- 
17325. The vectors had been non-linearly transformed as in case 
d. The class statistics employed were from case d. (ER.TS-1 im- 
age 1049-17324) 

f. The data was seven-dimensional vectors from ERTS-1 image 1103- 
17332. The vectors were scaled so that the average vectors for 
images 1049-17324 and 1103-17332 were equal. This was an ap- 
proximate scheme for compensating the lower solar elevation for 
image 1103. All other processing operations were the same as 
in case e. 

g. The data was from ERTS-1 image 1049-17324 processed using only 
spatial features and the heuristic algorithm. (See section 5.1.) 

Case b shows that the maximum likelihood criterion gives poor re- 
sults with spatial features for the desert class. This has been attri- 
buted to the wide deviation of the spatial features from Gaussian statis- 
tics. (See Figure 5-4.) This conclusion is supported by case g which 
shows very good results when the spatial features are used with the heu- 
ristic algorithm rather than the maximum likelihood criterion. 

Case a shows good results using the spectral features with the max- 
imum likelihood criterion. The reason is that the class distributions 
for these features, though not Gaussian, do not deviate as sharply as 
the class distributions for the spatial features. This is fortuitous 
because there has been very extensive use of the maximum likelihood 
criterion with multispectral data, and we are not aware of any other 
investigations into the nature of the statistics (Gaussian or not) . 

Cases c and d show that by combining the spectral with the spatial 
features, the recognition rates for urban areas have improved substan- 
tially. Comparing case d to cases b and c shows the importance of the 
non-linear transformations in achieving high recognition rates. Also, 
comparing case d to cases a and g further indicates that combining the 
spectral and spatial features improves the recognition rate of the urban 
areas significantly. 

For cases e and f, the recognition rates for desert and farms has 
improved over the corresponding recognition rates of case d. However, 
for mountains and urban areas, the recognition rates have progressively 
deteriorated. It appears, therefore, that the statistics of some clas- 
ses in a given geographic area vary substantially with the seasons. 

From the results of cases a through g, it appears that the spatial 
features have not increased the number of classes that are recognized 
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over what can be achieved with multispectral data only. However, the 
addition of the spatial features has increased the accuracy of recogni- 
tion significantly for the urban classes. 

There seems to be redundancy between the spectral and spatial fea- 
tures, particularly for the desert and farm classes. This, of course, 
is a characteristic of the geographic area and the ground resolved dis- 
tance of the ERTS-1 data. For example, the spectral features are actually 
identifying live, dense vegetation rather than farms. In this part of 
southern Arizona, such vegetation is associated with irrigated farmland. 

In another part of the country, it may be associated with deciduous for- 
ests, grassland, etc. On the other hand, the spatial features are iden- 
tifying farms, whether the farms have live vegetation (growing crops) or 
are fallow. 

Furthermore, the ground resolved distance of the ERTS-1 data is large 
and only large geographic features (bays, lakes, mountains, farmland, 
desert, forests, large urban areas, etc.) can be identified. These tend 
to be highly correlated spectrally and spatially. 

5. 3 Clustering Algorithm 

5.3.1 Background 

Various clustering algorithms have been investigated over the years. One 
of the established algorithms was developed at LARS, Purdue University and is 
part of the LARSYS software system. The algorithm has been described by 
Wacker 5 and Landgrebe and by Swain 6 . 

Conceptually, the clustering algorithms consist of the following parts: 

a. Determination of potential clusters and their statistics. 

b. Assignment of all feature vectors to the potential clusters. 

c. A criterion of separability is applied to the potential clusters 
to determine which clusters are unique. 

It turns out that part n a" is the most crucial and has the greatest 
impact on the success of the algorithm. There seem to be at least two 
approaches in determining potential clusters: 

a. The first approach is to search the vector space for local con- 
centrations of vectors. 

b. The second approach is to make some initial assumptions about the 
number of clusters and their distributions in the feature 
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space. Then, each vector is assigned to a cluster, and the 
statistics of each cluster are computed. Repeating this op- 
eration several times, hopefully leads to convergence to the 
true clusters present in the vector space. This is the ap- 
proach taken in the LARS clustering algorithm. 

The first approach is the direct method of determining the cluster 
centers. However, the digital operation of searching the feature space 
for vector concentrations is very time consuming. When a large number 
of vectors are being clustered, the computer time required can become 
prohibitive. 

The second approach has been employed to cluster large numbers of 
vectors because it produces simple algorithms and reasonable computation 
times. Unfortunately, there is no guarantee that these algorithms con- 
verge on the correct clusters every time. It might be necessary to per- 
form the clustering two or more times with different initial conditions. 
If the same clusters are developed for all initial conditions, then a 
degree of confidence is achieved on the correctness of the results. How 
ever, if the clusters are dependent on the initial conditions, then ambi 
guities may still exist as to which group of clusters is the correct one 

In this investigation, a fast algorithm has been developed which 
determines the centers of clusters without any initial conditions. The 
centers are developed by computing the Euclidean distances between the 
vectors and selecting groups of vectors whose distances between each 
other are small. Instead of searching the seven -dimensional vector 
space for local concentrations of vectors, the algorithm determines 
which of the data vectors are surrounded by a number of closely spaced 
vectors. The computer time consummed by this algorithm is a few minutes 
because in addition to the algorithm characteristics, the number of vec- 
tors being clustered is 1,116. 

The number of classes present in the data and their relative popu- 
lations depend on the geographic area from which the vectors have been 
derived. All clustering algorithms work better when the classes repre- 
sented in the data are equally populated. This, of course, does not 
occur naturally. (See Table 5-1.) However, if one assumes that at 
least 100 vectors per class are required to specify the statistics of 
a class with an acceptable accuracy level and that no more than 20 dif- 
ferent classes exist, then it should not be necessary to cluster more 
than 2,000 vectors. 

This condition can be satisfied since one has control of the number 
of vectors being introduced into the clustering algorithm as well as the 
geographic area they represent. For example, in the Phoenix area, one 
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could limit the number of vectors from the cultivated and desert areas 
while all the vectors from the urban and mountainous regions could be 
utilized. The conclusion is that a general knowledge of the geographic 
area by the analyst can be exploited to reduce the number of vectors 
processed by the clustering algorithm and improve the results of the 
clustering operation. 

After the potential cluster centers have been determined, it is 
then necessary to employ a criterion of separability in order to deter- 
mine which clusters are unique. Some of the potential cluster centers 
determined by the first operation are expected to be parts of larger 
clusters. After the criterion has been applied, some clusters are com- 
bined and others remain intact. 

The criterion of separability that is being employed in this inves- 
tigation is the divergence.* 

Swain 6 has described another criterion which he terms the "Swain- 
Fu distance". This criterion is employed in the LARSYS program and is 
computed from the mean distance between two clusters and their "ellip- 
soids of concentration" . 

In this investigation, the divergence criterion has been examined 
extensively. Using vectors whose classification was already known (from 
ERTS-1 image 1049-17324) the class statistics were first computed, and 
then the divergence between any two classes was computed using the cor- 
responding class statistics. 

If the divergence is greater than 10, then the probability that a 
vector belonging to one of two classes can be correctly identified is 
greater than 80%. The divergence is also dependent on Gaussian statis- 
tics. Before the non-linear transformations of the vectors (when the 
classes were highly non-Gaussian) , the divergence values varied between 
400 and 6 x 1O 0 and seemed to bear no relationship to the recognition 
rates achieved. After the non-linear transformations were completed, 
rendering the classes approximately Gaussian, the divergence values took 
on a more reasonable range (20 - 100) and generally speaking the higher 
divergence values were associated with pairs of classes between which 
very few errors were made. 

The conclusion drawn from this analysis is that the divergence is 
a useful criterion of separability of clusters whenever the clusters 
are approximately Gaussian. 


*Marill, T. and Green, D., On the Effectiveness of Receptors in Rec- 
ognition Systems, IEEE Transactions of Information Theory, Jan. 1963, p. 11. 
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5.3.2 Clustering Algorithm Operation 


Figure 5-10 shows a block diagram of the clustering algorithm that 
has been developed. In the first subroutine (Distance Computation) 
an Euclidean distance is computed for every pair of vectors. About 200 
vectors are input ed and 19,900 distances between them are computed. 

The distances are compared to an initial threshold t^ and those smaller 
than t 0 are stored together with the vectors that produced them. This 
operation should produce a list of 400-500 minimum distances. If fewer 
than 400 distances are found, the threshold t 0 is increased. All the 
distances are computed again, and, those less than the new threshold, 
are stored. Conversely, if more than 500 distances are found, then the 
threshold is reduced, etc. 

In the next subroutine (Selection of Cluster Centers), the input 
data consists of the 400—500 minimum distances and two vector arrays 
containing the vectors associated with, these distances. A search is made 
through these vector arrays, and all vectors (Vj ) which appear more 
than 15 times are identified as potential centers of clusters. For 
each vector Vj , the distances djk from the associated Vk vectors are 
listed and ordered in increasing magnitude. The 16 smallest distances 
are summed and form the sums Sjk* Assuming there were twelve vectors 
V j , there will be twelve sums Sjk* The minimum of these is selected, 
and the associated vector^Vjlm is selected as the center of the 
densest cluster. All vectors associated with CVj'J m , are assigned 
to the first cluster. This operation leads to the formation of the 
core of cluster No. 1 which is also the densest cluster. Note, that 
a specific vector may belong to the Vj category by appearing more than 
15 times and the Vk category by being associated with another vector 
Vj . Therefore, all vectors which are assigned to cluster No. 1 are 
subsequently removed from the Vj and the Vj^ categories. This guarantees 
that subsequent clusters to be formed will not overlap with cluster 
No. 1. Of the original set of twelve Vj vectors, there may remain only 
five vectors after the No. 1 cluster vectors are removed. Seven Vj 
vectors were eliminated either because they were assigned to cluster 
No. 1 or because they lost some of their Vk vectors to cluster No. 1 
so they no longer had more than 15 vectors associated with them. 

The reduced sets of Vj and V k vectors are recycled through the 
same subroutine as if they were the original sets and the No. 2 
cluster core is formed. Then, the Vj and Vk vector sets are reduced 
again and recycled through the same subroutine to form perhaps the core 
of cluster No. 3, etc. When no more Vj vectors are left, the formed 
clusters (three in this case) are tested in the next subroutine 
(Combine Clusters by Divergence) for statistical separability. The 
divergence between every pair of clusters is computed, and a pair of 
clusters is merged if the respective divergence value is less than 10. 
Such clusters tend to be adjacent parts of a larger cluster and should 
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CLASSIFIED VECTORS 
PLUS CLUSTER STATISTICS 


Fig. 5-10 — Clustering algorithm block diagram 
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therefore be united. The clusters that survive this test, (perhaps two 
clusters are left in the example given above) are the cores of the densest 
clusters. By eliminating them, it will then be possible to identify 
the less dense or diffuse clusters* Therefore, the entire sequence 
of processing operations is repeated except that the vectors already 
assigned to clusters are not included in the input vector set* This 
clustering sequence produces a few more clusters, and the vectors 
assigned to those clusters are removed from the input vector set. A 
third recycling through the clustering algorithm usually terminates the 
cluster core formation by depleting the initial set of input vectors. 

Of the original 200 vectors, about 180 are assigned to about 10 elementary 
clusters. These elementary clusters consist of the cores of the dense 
and diffuse clusters as well as regions of the vector space where 
vectors from two or more true clusters are intermixed. These false 
cluster cores are usually formed after the diffuse cluster cores. But, 
one does not know whether the last clusters to be formed are true 
or false. In other words, the clustering program will occasionally 
overproduce cluster cores. 

The false cluster cores are identified with a divergence test. 

The divergences of the last cluster to be formed against the other 
clusters are computed. If one or more divergence values are less than 45, 
the last cluster is dissolved. The same test is applied to the next 
to the last cluster, etc, so that all clusters retained have divergence 
values between each other, greater than 45. The result may be that from 
the 10 elementary clusters only a few (usually 3-8) are retained. 

At this point, the true cluster cores have been identified, but the 
cluster statistics are not known accurately and only about 100 vectors 
have been classified. In the last subroutine (Classify Remaining 
Vectors), the remaining vectors (about 1,000) are assigned to the 
cluster cores by the maximum likelihood criterion. The assignment 
is done iteratively by first assigning vectors only near the cluster 
cores. Then, the cluster statistics are recomputed and additional 
vectors are assigned. In the process, the clusters grow and their 
statistics change. This subroutine allows for seven iterations. By 
the last iteration, most of the vectors have been classified. The 
remaining vectors are left in the unknown category (Class 0) because 
their probabilities of belonging to any cluster are very small. 

5.3.3 Clustering Results 

The clustering algorithm operates on the basis of a number of 
assumptions which have been reduced to a minimum, but cannot be eliminated 
These assumptions are: 

A. An assumption must be made about the cluster statistics. In 
this case, it has been assumed that the cluster statistics 
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Table 5-5 


Comparison of Clustering Results 
ERTS-1 Image 1103-17332 
Phoenix, Arizona 
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Table 5-6. Comparison of Clustering Results - 
ERTS-1 Image 1070-17495 
Imperial Valley, California 


Photointerpreter 



B = Boundaries 
W1 = Clear Water 
W2 = Turbid Water 
Dl = Bright Smooth Desert 
D2 = Bright Rough Desert 
D3 - Dark Desert 
M “ Hills and Mountains 
U = Urban 

FI - Irregular Farms 
F2 = Rectangular Farms 

















Table 5-7. Comparison of Clustering Results 
ERTS-1 Image 1034-17500 
Imperial Valley, California 
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Table 5-8, Comparison of Clustering Results 
ERTS-1 Image 1040-18201 
Cascade Mountains, Washington 


Photointerpreter 


< 

B 

M2 

Fl 

Ml 

F2 Detection Rates, 

0 

45 

29 

23 

13 

° Ml 96 

u i 

Q) - 1 - 

26 

280 

28 

4 

0 M2 88 

3 

% 2 
o — 

0 

1 

215 

0 

0 F1 81 

u 

3 

0 

10 
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B = Boundaries 
Ml - Forested Mountains 

M2 = Riverbeds, Unforested Hills and Mountains 
Fl = Dry Farming 
F2 = Irrigated Farms 
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Table 5-9. Comparison of Clustering Results 
ERTS-1 Image 1077-18260 
Cascade Mountains, Washington 


Photointerpreter 
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B = Boundaries 
FI = Dry Farming 
F2 = Irrigated Farming 
Ml = Forested Mountains 
M2 = Unforested Mountains 
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Table 5-10. Comparison of Clustering Results 
ERTS-1 Image 1015-17415 
Salt Lake/ Utah 


Pho to in t erpr e ter 



Detection Rates, % B = Boundaries 
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W1 = Clear Water 
W2 = Turbid or Shallow Water 
M = Forested Mountains 
U = Urban 

F = Farms (Irrigated) 

D = Desert, Hills, Unforested 
Mountains, Dry Farms. 
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Table 5-11. Comparison of Clustering Results 
ERTS-1 Image 1069-17420 
Salt Lake, Utah 
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are Gaussian, and therefore, non-linear transformations of 
the feature vectors are required. 

B. Initially, the algorithm assumes that cluster centers are 
spherical and applies the Euclidean distance criterion. 

C. The algorithm finds cluster centers on the basis of local 
vector densities. Therefore, it is important that equal 
numbers (approximately) of vectors from each class are being 
clustered. 

The clustering algorithm has been applied to vectors from several 
ERTS-1 images. We have found that the formation of the cluster centers 
is the crucial part of the algorithm and it is affected by the relative 
amplitudes of the vector components (due to the Euclidean distance 
criterion) . It is also very important to introduce about equal numbers 
of vectors for each terrain class. This is being accomplished by 
selecting the vectors which are employed by the algorithms to form the 
cluster centers. A human looks at an ERTS color composite and selects 
equal size (as far as possible) areas that appear homogeneous. The 
vectors from these areas form the subset from which the algorithm 
develops the cluster centers. Consider an image consisting of 70% 
mountains, 20% farmland and 10% urban. If all the vectors are used 
for the cluster center formation, then it is quite likely that the 
urban area would be absorbed in the mountain class. If, however, the 
vectors used for cluster center formation represent 25% mountains, 20% 
farmland and 10% urban areas, then the cluster centers will probably 
be formed correctly. No training data or a priori knowledge of the 
geographical region is required. The human interacts with the algor itlim, 
but does not bias the algorithm’s decisions. 

Tables 5-5 and 5-6 summarize the results of the clustering algorithm 
operating on images 1103-17332 from Phoenix, Arizona and 1070-17495 from 
the Imperial Valley, California. Tor either image, the clusters are 
identified by non-zero numbers in the sequence of their formation. Some 
vectors could not be assigned to any cluster with a high dgree of 
confidence and were left in the zero category. Most of these vectors 
represented cells containing boundaries between terrain classes. The 
detection rates for the various terrain classes are also shown in 
Tables 5-5 and 5-6. For image 1103-17332, the detection rates should 
be compared to the appropriate entry in Table 5-4. 

For image 1070-17495, the clustering algorithm had some surprising 
results. It produced two water classes and three desert classes. 
Photointerpretation of ERTS-1 color composites suggests only one water 
and one desert class. 

Table 5-7 summarizes the results of the clustering algorithm 
operating on image 1034-17500 from the Imperial Valley, California. 
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Fig. 5-11 — ERTS-1 image 1103-17332 with clustering results 
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Fig. 5-12 — ERTS-1 image 1070-17495 with clustering results 
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Fig. 5-14 — ERTS-1 image 1040-18201 with clustering results 
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Fig. 5-15 — ERTS-1 image 1077-18260 with clustering results 
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Tables 5-6 and 5-7 should be compared. The geographical areas are 
practically the same with the 1034-17500 image being shifted by 5.5 
kilometers to the west of the 1070-17495 image. The differences between 
Tables 5-6 and 5-7 reflects primarily the differences in the MSS data 
from the two acquisition dates. 

Tables 5-8 and 5—9 summarize the results of the clustering algorithm 
operating on data from images 1040-18201 and 1077-18260, respectively. 

The geographical area is in the Cascades Mountains, Washington, with 
the 1040-18201 image shifted by 21 kilometers north of the 1077-18260 
image. 

Tables 5-10 and 5-11 summarize the results of the clustering 
algorithm operating on data from images 1015-17415 and 1069-17420, 
respectively. The geographical area is around Salt Lake City, Utah, 
with the 1069-17420 image shifted by 6.7 km east and 11.2 km south 
with respect to the 1015—17415 image. 

Examination of Table 5—11 shows that cluster number 4 contains a 
mixture of irrigated farms, forested mountains, urban areas, low mountains, 
hills and desert. It perhaps is an example of a cluster with a small 
number of vectors (the urban class) which has been shifted towards the 
heavily populated classes (Ml and M2). Also, cluster number 6 appears 
to be a mixture of irrigated farms (which are irregularly shaped in 
this part of the country), low mountains with sparse vegetation, hills 
and desert. The category of dry farms with a small vector population 
was recognized as a separate cluster. 

Table 5-10 shows that the clustering algorithm gave better results 
with image 1015-17415. Only cluster number 6 appears to be a mixed one 
and includes the dry farms. The difference between images 1015 (7 August) 
and 1069 (30 September) affecting the operation of the clustering 
algorithm appears to be seasonal. A color composite of 1015 shows 
bright reflected radiation and a dry terrain appearance with only the 
high mountains displaying lush vegetation. A color composite of 1069 
shows low reflected radiation with lush vegetation extending into valleys 
which were dry in the summer. It appears that rainfall during September 
brought out this vegetation (possibly grasses). It also made the dry 
farmed area more distinct from the desert and the irrigated crops 
appeared to be vigorously growing. Thus, it appears that the clustering 
results are influenced by the seasonal multispectral variations of the 
terrain. Classes are combined whenever their seasonal variations make 
them appear less distinct. 

A general observation is that classes with large vector populations 
tend to absorb or shift the clusters of other classes with small vector 
populations. This is to be expected because most clustering algorithms 
utilize the vector density (in the vector space) as a means of forming 
clusters. 

Figures 5-11 through 5-17 show the clustering results superimposed 
as numerical annotation on the corresponding MSS 5 images. 
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NEW TECHNOLOGY 


6 . 


6.1 Spatial Signatures 

A spatial signature for cultivated land was discovered in the 
diffraction patterns of ERTS transparencies and in the Fourier transforms 
of digital data from the ERTS computer compatible tapes. The signature 
consists of two orthogonal rows of frequency spots and is predominant in 
the MSS 5 band. {See Section 4.1,3) The spacing of the frequency spots 
may not be the same in both rows. 

A spatial signature for urban areas was discovered in the diffraction 
patterns of ERTS transparencies and in the Fourier transforms of digital data 
from the ERTS computer compatible tapes. The signature consists of two or- 
thogonal rows of frequency spots from thw MSS 7 band, but is not present in 
the MSS 5 band. 
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6.2 Spatial Features 


The investigation has proven that the four spatial features S , S^t S 3 
and S 4 selected from the Fourier transforms of the digital data contained 
the necessary information for the computer recognition of terrain types 
from a single spectral band (MSS 5). (See Section 4*2) A method was also 
developed to measure the spatial features rapidly by sampling the diffraction 
patterns of ERTS imagery with electro-optic sensors (See Section 4.3). 
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6.3 Non-Linear Transformations of Feature Vectors 


The feature vectors that describe terrain types contain three spectral 
features and four spatial features. It was discovered that the distributions 
of the terrain class vectors were not Gaussian and the recognition accuracy 
of the maximum likelihood algorithm which is widely used was adversely affected 
by the non-Gaussian statistics (See Sections 5.2.1 and 5.2.2). 

Non-linear transformations of the feature vectors were then developed 
which corrected the terrain class statistics and made them symmetrical about 
their means. As a result of the non-linear transformations, the accuracy 
of terrain recognition increased remarkably (see Sections 5.2.3 and 5.2.4). 



6.4 Clustering Algorithm 

A new clustering algorithm has been developed which makes no a priori 
assumptions about cluster distributions. (See entire Section 5.3) The 
algorithm first searches the feature vector space for local concentrations 
of vectors and determines potential cluster centers and their statistics. 

Then, it applies the divergence criterion to test the statistical separa- 
bility of potential cluster centers and identifies the true cluster centers 
from other vector concentrations. Finally, the cluster centers are grown into 
full size clusters through assignment of all feature vectors by the maximum 
likelihood criterion. The end result of the clustering algorithm is the 
determination of the statistically separable clusters and their statistics 
and the classification of all feature vectors. 

The clustering algorithm has been applied on data from seven ERTS images 
and the results have been excellent (see Section 5.3.3). 
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7. 


CONCLUSIONS 


Through the developed algorithms, it is possible to automatically 
recognize terrain types* By using the clustering algorithm that was 
developed, it is possible to achieve high accuracy in the recognition 
process and nearly complete automation since training data is not needed 
to train the algorithm. 

When the correct statistics are employed, the machine recognition 
appears to be more accurate than a human photointerpreter who has been 
constrained to use ERTS-1 color composites only. Stated another way, 
machine recognition appears to be more sensitive and can operate much 
closer to the resolution limit of the ERTS-1 imagery than the human 
photointerpreter. 

The experimental results show that the accuracy of terrain recogni- 
tion is affected by the statistics employed. The accuracy is best when the 
statistics are approximately Gaussian, and it is necessary to apply non- 
linear transformations to the feature vectors to achieve approximately 
Gaussian distributions. The class statistics vary appreciably between 
seasons, such as summer and fall. 

The experimental results also showed that combining the spectral data 
with the spatial features increased the accuracy of the machine recognition - 

The clustering algorithm has produced excellent results and forms 
statistically separable clusters which reflect the scale and resolution of 
the ERTS-1 data. In some cases, it will produce more classes than photoin- 
terpretation of ERTS-1 color composites would suggest. 

8. POTENTIAL APPLICATIONS AND SIGNIFICANT RESULTS 

The results of this investigation are directly applicable to the auto- 
matic production of thematic maps, and in particular, of land-use maps. 
Anderson et al^ have described three levels of land-use classification. The 
terrain classes recognized in this investigation using training data fall in 
the first classification level. The use of the clustering algorithm with 
combined spectral and spatial features can increase the number of classes 
recognized beyond the first level. 


5 

Anderson, J. R. , Hardy, E. E. , Roach, J. T. , "A Land-Use Classification 
System for Use with Remote-Sensor Data", Geological Survey Circular 671, 
1972, Washington. 
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In order to pursue these applications in depth, a proposal for a 
follow-on investigation was submitted to NASA on December 21, 1973. The 
Abstract of this proposed investigation is given below: 

This investigation is primarily concerned with the application 
of automated interpretation techniques to two specific prob- 
lems: (1) the production of thematic maps for two test sites 

in order to evaluate the utility and cost-effectiveness of 
ERTS data for nuclear power plant siting and environmental 
impact studies, and (2) the production of land-use maps for 
two statistical metropolitan areas in order to monitor 
changes in population. The thematic and land-use maps will 
be produced using both multispectral and spatial signatures. 

The interpretation techniques and software developed under 
the ERTS-1 investigation "Automated Thematic Mapping and 
Change Detection of ERTS-1 Images" (NAS 5- 2 1766) will be 
effectively utilized to process MSS data. The thematic and 
land-use maps will be evaluated for their utility and cost- 
effectiveness by co-investigators from the U. S. Bureau of 
Census and the consulting engineering firm of Dames & Moore, 

Ground truth data supplied by the co- investigators and air- 
craft underflight photography will be used in the evaluation. 

The results of this investigation are expected to further the 
goal of utilizing ERTS data to solve problems of national 
importance. 

The most significant results of the investigation are: 

1. The discovery of the spatial signatures for cultivated land and 
urban areas. 

2. The development of algorithms and a software system which recog- 
nizes terrain automatically and accurately from ERTS-1 digital data* This 
software system is now ready to be applied in many applications where the- 
matic maps are needed, including the applications described in the follow-on 
investigation proposal. The most obvious application is the generation of 
land-use maps. 

9 . RECOMMENDATIONS 

It is recommended that the software system and the pattern recognition 
techniques developed under this investigation be applied on many problems of 
national importance for which remote sensing of the earth by the ERTS-1 
satellite can provide very valuable information. Two such problems have been 
identified for the proposed ERTS-B investigation (see Abstract in Section 8) . 
In addition, the following applications are suggested as being potentially 
of high economic value: 
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a. Monitoring of agricultural crops and forest resources to detect 
crop stresses and estimate crop yield. 

b. Uniform land-use mapping for the entire country. 

c. Monitoring of snow fields to predict flooding and estimate 
water resources. 

d. Detection of water pollution. 

While the ERTS-1 satellite has already produced an enormous volume of 
data, the extraction of useful information from the data and its application 
to the solution of problems of national importance has just begun. There- 
fore, in 1974 great emphasis should be placed on applications of the existing 
ERTS-1 data base. In 1972 and 1973, the interpretation techniques of ERTS-1 
data have advanced very substantially to the point that very significant re- 
sults can be expected when these techniques are applied to solve real problems 
Unquestionably, the interpretation techniques need further development and 
refinement, but such development would be more fruitful if accomplished while 
the techniques and the ERTS data are being applied on real problems. 

There are several areas in the interpretation techniques that need 
further development: 

1. Generation of accurate boundaries between recognized classes. 

A lot of work has been done on boundary detection and this 
problem appears to be near solution. 

2. Transformations of the spectral data. It is well known that 
the spectral bands are highly correlated. This data should 
be processed through non-linear spectral transformations so 
that the data would be in optimum form for the clustering 
algorithm. 

3. More research on spatial features should be conducted. While 
the spatial features developed in this investigation have 
produced excellent results, they are not unique. 

4. Development of high speed processing hardware. General pur- 
pose computers are excellent for research on interpretation 
techniques and applications, but are not cost-effective for 
the processing of large volumes of ERTS data. The cost-effec- 
tiveness problem must be solved with high speed processing 
hardware if the ERTS data is to be used widely and economic 
benefits are to be derived from it. 
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